Review



read sequencing data  (Broad Clinical Labs)


Bioz Verified Symbol Broad Clinical Labs is a verified supplier
Bioz Manufacturer Symbol Broad Clinical Labs manufactures this product  
  • Logo
  • About
  • News
  • Press Release
  • Team
  • Advisors
  • Partners
  • Contact
  • Bioz Stars
  • Bioz vStars
  • 96

    Structured Review

    Broad Clinical Labs read sequencing data
    A. Principal components analysis of autochthonous KMC, KM/+C, and KPC tumors cluster into three distinctive groups B. Scatter plot with regression line demonstrating correlation between normalized Kras expression and Myc expression in murine tumors (Pearson r correlation coefficient = 0.74), with grouping of transcriptional clustering, irrespective of genotype. C. Heatmap demonstrating sample clustering by GSVA scoring of hallmark gene sets including several curated PDAc Kras specific signatures. Annotation includes copy number alteration for Kras and Myc, transcriptional clustering, PurIST subtyping by shrunken centroid, and model genotype. D. Copy number alteration plots of KMC mice from low pass whole genome <t>sequencing,</t> separated by transcriptional Cluster demonstrating diverse patterns of chromosomal instability in the KMC model; annotated with approximate location of canonical PDAc drivers and KMC alleles.
    Read Sequencing Data, supplied by Broad Clinical Labs, used in various techniques. Bioz Stars score: 96/100, based on 681 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
    https://www.bioz.com/result/read sequencing data/product/Broad Clinical Labs
    Average 96 stars, based on 681 article reviews
    read sequencing data - by Bioz Stars, 2026-04
    96/100 stars

    Images

    1) Product Images from "Myc and Kras cooperate in adult acinar cells to drive phenotypic heterogeneity, metastasis, and therapeutic resistance in a novel pancreatic cancer mouse model"

    Article Title: Myc and Kras cooperate in adult acinar cells to drive phenotypic heterogeneity, metastasis, and therapeutic resistance in a novel pancreatic cancer mouse model

    Journal: bioRxiv

    doi: 10.1101/2025.07.14.664767

    A. Principal components analysis of autochthonous KMC, KM/+C, and KPC tumors cluster into three distinctive groups B. Scatter plot with regression line demonstrating correlation between normalized Kras expression and Myc expression in murine tumors (Pearson r correlation coefficient = 0.74), with grouping of transcriptional clustering, irrespective of genotype. C. Heatmap demonstrating sample clustering by GSVA scoring of hallmark gene sets including several curated PDAc Kras specific signatures. Annotation includes copy number alteration for Kras and Myc, transcriptional clustering, PurIST subtyping by shrunken centroid, and model genotype. D. Copy number alteration plots of KMC mice from low pass whole genome sequencing, separated by transcriptional Cluster demonstrating diverse patterns of chromosomal instability in the KMC model; annotated with approximate location of canonical PDAc drivers and KMC alleles.
    Figure Legend Snippet: A. Principal components analysis of autochthonous KMC, KM/+C, and KPC tumors cluster into three distinctive groups B. Scatter plot with regression line demonstrating correlation between normalized Kras expression and Myc expression in murine tumors (Pearson r correlation coefficient = 0.74), with grouping of transcriptional clustering, irrespective of genotype. C. Heatmap demonstrating sample clustering by GSVA scoring of hallmark gene sets including several curated PDAc Kras specific signatures. Annotation includes copy number alteration for Kras and Myc, transcriptional clustering, PurIST subtyping by shrunken centroid, and model genotype. D. Copy number alteration plots of KMC mice from low pass whole genome sequencing, separated by transcriptional Cluster demonstrating diverse patterns of chromosomal instability in the KMC model; annotated with approximate location of canonical PDAc drivers and KMC alleles.

    Techniques Used: Expressing, Sequencing

    A. A circular dendrogram of primary human PDAc, KMC, and KPC tumors subjected to RNA sequencing, normalization, homolog mapping, and ComBat normalization. Histogram is by unsupervised Ward D2 clustering. Annotation rings (inner to outermost) describe receipt of chemotherapy (neoadjuvant = pink, adjuvant = blue), pORG score tertile, single-sample GSEA Myc V1 targets score, and survival quartile. Outer ring denotes subject number, colored for humans by PurIST subtyping (blue = basal-like, red = classical) and for mice by genotype (green = KMC, light blue = KM/+C, orange = KPC). The KMC transcriptional clusters (cf. fig 5A) are annotated.
    Figure Legend Snippet: A. A circular dendrogram of primary human PDAc, KMC, and KPC tumors subjected to RNA sequencing, normalization, homolog mapping, and ComBat normalization. Histogram is by unsupervised Ward D2 clustering. Annotation rings (inner to outermost) describe receipt of chemotherapy (neoadjuvant = pink, adjuvant = blue), pORG score tertile, single-sample GSEA Myc V1 targets score, and survival quartile. Outer ring denotes subject number, colored for humans by PurIST subtyping (blue = basal-like, red = classical) and for mice by genotype (green = KMC, light blue = KM/+C, orange = KPC). The KMC transcriptional clusters (cf. fig 5A) are annotated.

    Techniques Used: RNA Sequencing, Adjuvant



    Similar Products

    99
    TaKaRa data long read ont sequencing fastq files ncbi s
    Data Long Read Ont Sequencing Fastq Files Ncbi S, supplied by TaKaRa, used in various techniques. Bioz Stars score: 99/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
    https://www.bioz.com/result/data long read ont sequencing fastq files ncbi s/product/TaKaRa
    Average 99 stars, based on 1 article reviews
    data long read ont sequencing fastq files ncbi s - by Bioz Stars, 2026-04
    99/100 stars
      Buy from Supplier

    96
    Broad Clinical Labs read sequencing data
    A. Principal components analysis of autochthonous KMC, KM/+C, and KPC tumors cluster into three distinctive groups B. Scatter plot with regression line demonstrating correlation between normalized Kras expression and Myc expression in murine tumors (Pearson r correlation coefficient = 0.74), with grouping of transcriptional clustering, irrespective of genotype. C. Heatmap demonstrating sample clustering by GSVA scoring of hallmark gene sets including several curated PDAc Kras specific signatures. Annotation includes copy number alteration for Kras and Myc, transcriptional clustering, PurIST subtyping by shrunken centroid, and model genotype. D. Copy number alteration plots of KMC mice from low pass whole genome <t>sequencing,</t> separated by transcriptional Cluster demonstrating diverse patterns of chromosomal instability in the KMC model; annotated with approximate location of canonical PDAc drivers and KMC alleles.
    Read Sequencing Data, supplied by Broad Clinical Labs, used in various techniques. Bioz Stars score: 96/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
    https://www.bioz.com/result/read sequencing data/product/Broad Clinical Labs
    Average 96 stars, based on 1 article reviews
    read sequencing data - by Bioz Stars, 2026-04
    96/100 stars
      Buy from Supplier

    90
    Pacific Biosciences long-read pacific biosciences high-fidelity (hifi) sequencing data
    A. Example IGV views of a germline and postzygotic mutation in <t>HiFi</t> read data. B. The number of autosomal de novo germline and postzygotic SNVs, insertions and deletions <50 bp, and tandem repeat mutations observed in each sample. Sibling pairs are grouped by family and highlighted in blue, with the proband above the sibling. C. Upset plot of origin assignment shows concordance between HiFi haplotypes, ONT haplotypes, and allele balance. D. Allele balance distribution for autosomal germline and postzygotic SNVs across PacBio HiFi, Illumina, and ONT read data. E. Distribution of the size of autosomal insertions, deletions, and tandem repeat mutations.
    Long Read Pacific Biosciences High Fidelity (Hifi) Sequencing Data, supplied by Pacific Biosciences, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
    https://www.bioz.com/result/long-read pacific biosciences high-fidelity (hifi) sequencing data/product/Pacific Biosciences
    Average 90 stars, based on 1 article reviews
    long-read pacific biosciences high-fidelity (hifi) sequencing data - by Bioz Stars, 2026-04
    90/100 stars
      Buy from Supplier

    90
    Illumina Inc short read sequencing data
    Analysis of telomere repeat k-mers in long read sequences from A. thaliana Col-0. (A) Example Type I, II, and III reads from A. thaliana Col-0 Nanopore reads. Each color represents an exact match to the telomere repeat AAACCCT and the position found across the <t>sequencing</t> read. Reads are shown so the 5’ end of the sequence is enriched for the AAACCCT repeat and reads with 3’ end enriched for the complement sequence TTTGGGA we show the reverse complement sequence. (B) Co-occurrence heatmap displaying the frequency of a telomere repeat 5-mer (original telomere repeat sequence is AAACCCT) with all possible dinucleotide sequences that can be found at the end of the 5-mer. Top shows frequencies from analyzing reads aligning to chromosome 1R for Nanopore reads and bottom show frequencies from PacBio reads aligning to chromosome 1R. (C) An example Type III Nanopore sequencing read displaying the occurrence of a 4-mer and 5-mer of the original telomere repeat sequence AAACCCT (Top). A sliding window analysis where each window is size 100 bp and it slides 7 bp (bottom). In the window the average k-mer count is calculated and a change point detection method is applied to determine the window where there is a drop in telomere repeat count (red dotted line). (D) Telomere length estimates for Nanopore sequencing reads aligning to chromosome 1L and 1R.
    Short Read Sequencing Data, supplied by Illumina Inc, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
    https://www.bioz.com/result/short read sequencing data/product/Illumina Inc
    Average 90 stars, based on 1 article reviews
    short read sequencing data - by Bioz Stars, 2026-04
    90/100 stars
      Buy from Supplier

    90
    Illumina Inc whole-genome sequencing raw data illumina reads
    Analysis of telomere repeat k-mers in long read sequences from A. thaliana Col-0. (A) Example Type I, II, and III reads from A. thaliana Col-0 Nanopore reads. Each color represents an exact match to the telomere repeat AAACCCT and the position found across the <t>sequencing</t> read. Reads are shown so the 5’ end of the sequence is enriched for the AAACCCT repeat and reads with 3’ end enriched for the complement sequence TTTGGGA we show the reverse complement sequence. (B) Co-occurrence heatmap displaying the frequency of a telomere repeat 5-mer (original telomere repeat sequence is AAACCCT) with all possible dinucleotide sequences that can be found at the end of the 5-mer. Top shows frequencies from analyzing reads aligning to chromosome 1R for Nanopore reads and bottom show frequencies from PacBio reads aligning to chromosome 1R. (C) An example Type III Nanopore sequencing read displaying the occurrence of a 4-mer and 5-mer of the original telomere repeat sequence AAACCCT (Top). A sliding window analysis where each window is size 100 bp and it slides 7 bp (bottom). In the window the average k-mer count is calculated and a change point detection method is applied to determine the window where there is a drop in telomere repeat count (red dotted line). (D) Telomere length estimates for Nanopore sequencing reads aligning to chromosome 1L and 1R.
    Whole Genome Sequencing Raw Data Illumina Reads, supplied by Illumina Inc, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
    https://www.bioz.com/result/whole-genome sequencing raw data illumina reads/product/Illumina Inc
    Average 90 stars, based on 1 article reviews
    whole-genome sequencing raw data illumina reads - by Bioz Stars, 2026-04
    90/100 stars
      Buy from Supplier

    90
    Illumina Inc sequence read analysis data
    Analysis of telomere repeat k-mers in long read sequences from A. thaliana Col-0. (A) Example Type I, II, and III reads from A. thaliana Col-0 Nanopore reads. Each color represents an exact match to the telomere repeat AAACCCT and the position found across the <t>sequencing</t> read. Reads are shown so the 5’ end of the sequence is enriched for the AAACCCT repeat and reads with 3’ end enriched for the complement sequence TTTGGGA we show the reverse complement sequence. (B) Co-occurrence heatmap displaying the frequency of a telomere repeat 5-mer (original telomere repeat sequence is AAACCCT) with all possible dinucleotide sequences that can be found at the end of the 5-mer. Top shows frequencies from analyzing reads aligning to chromosome 1R for Nanopore reads and bottom show frequencies from PacBio reads aligning to chromosome 1R. (C) An example Type III Nanopore sequencing read displaying the occurrence of a 4-mer and 5-mer of the original telomere repeat sequence AAACCCT (Top). A sliding window analysis where each window is size 100 bp and it slides 7 bp (bottom). In the window the average k-mer count is calculated and a change point detection method is applied to determine the window where there is a drop in telomere repeat count (red dotted line). (D) Telomere length estimates for Nanopore sequencing reads aligning to chromosome 1L and 1R.
    Sequence Read Analysis Data, supplied by Illumina Inc, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
    https://www.bioz.com/result/sequence read analysis data/product/Illumina Inc
    Average 90 stars, based on 1 article reviews
    sequence read analysis data - by Bioz Stars, 2026-04
    90/100 stars
      Buy from Supplier

    90
    Illumina Inc short read rna-sequencing data
    Analysis of telomere repeat k-mers in long read sequences from A. thaliana Col-0. (A) Example Type I, II, and III reads from A. thaliana Col-0 Nanopore reads. Each color represents an exact match to the telomere repeat AAACCCT and the position found across the <t>sequencing</t> read. Reads are shown so the 5’ end of the sequence is enriched for the AAACCCT repeat and reads with 3’ end enriched for the complement sequence TTTGGGA we show the reverse complement sequence. (B) Co-occurrence heatmap displaying the frequency of a telomere repeat 5-mer (original telomere repeat sequence is AAACCCT) with all possible dinucleotide sequences that can be found at the end of the 5-mer. Top shows frequencies from analyzing reads aligning to chromosome 1R for Nanopore reads and bottom show frequencies from PacBio reads aligning to chromosome 1R. (C) An example Type III Nanopore sequencing read displaying the occurrence of a 4-mer and 5-mer of the original telomere repeat sequence AAACCCT (Top). A sliding window analysis where each window is size 100 bp and it slides 7 bp (bottom). In the window the average k-mer count is calculated and a change point detection method is applied to determine the window where there is a drop in telomere repeat count (red dotted line). (D) Telomere length estimates for Nanopore sequencing reads aligning to chromosome 1L and 1R.
    Short Read Rna Sequencing Data, supplied by Illumina Inc, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
    https://www.bioz.com/result/short read rna-sequencing data/product/Illumina Inc
    Average 90 stars, based on 1 article reviews
    short read rna-sequencing data - by Bioz Stars, 2026-04
    90/100 stars
      Buy from Supplier

    90
    Oxford Nanopore long read rna-sequencing data
    Analysis of telomere repeat k-mers in long read sequences from A. thaliana Col-0. (A) Example Type I, II, and III reads from A. thaliana Col-0 Nanopore reads. Each color represents an exact match to the telomere repeat AAACCCT and the position found across the <t>sequencing</t> read. Reads are shown so the 5’ end of the sequence is enriched for the AAACCCT repeat and reads with 3’ end enriched for the complement sequence TTTGGGA we show the reverse complement sequence. (B) Co-occurrence heatmap displaying the frequency of a telomere repeat 5-mer (original telomere repeat sequence is AAACCCT) with all possible dinucleotide sequences that can be found at the end of the 5-mer. Top shows frequencies from analyzing reads aligning to chromosome 1R for Nanopore reads and bottom show frequencies from PacBio reads aligning to chromosome 1R. (C) An example Type III Nanopore sequencing read displaying the occurrence of a 4-mer and 5-mer of the original telomere repeat sequence AAACCCT (Top). A sliding window analysis where each window is size 100 bp and it slides 7 bp (bottom). In the window the average k-mer count is calculated and a change point detection method is applied to determine the window where there is a drop in telomere repeat count (red dotted line). (D) Telomere length estimates for Nanopore sequencing reads aligning to chromosome 1L and 1R.
    Long Read Rna Sequencing Data, supplied by Oxford Nanopore, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
    https://www.bioz.com/result/long read rna-sequencing data/product/Oxford Nanopore
    Average 90 stars, based on 1 article reviews
    long read rna-sequencing data - by Bioz Stars, 2026-04
    90/100 stars
      Buy from Supplier

    90
    Illumina Inc sequence read data
    Analysis of telomere repeat k-mers in long read sequences from A. thaliana Col-0. (A) Example Type I, II, and III reads from A. thaliana Col-0 Nanopore reads. Each color represents an exact match to the telomere repeat AAACCCT and the position found across the <t>sequencing</t> read. Reads are shown so the 5’ end of the sequence is enriched for the AAACCCT repeat and reads with 3’ end enriched for the complement sequence TTTGGGA we show the reverse complement sequence. (B) Co-occurrence heatmap displaying the frequency of a telomere repeat 5-mer (original telomere repeat sequence is AAACCCT) with all possible dinucleotide sequences that can be found at the end of the 5-mer. Top shows frequencies from analyzing reads aligning to chromosome 1R for Nanopore reads and bottom show frequencies from PacBio reads aligning to chromosome 1R. (C) An example Type III Nanopore sequencing read displaying the occurrence of a 4-mer and 5-mer of the original telomere repeat sequence AAACCCT (Top). A sliding window analysis where each window is size 100 bp and it slides 7 bp (bottom). In the window the average k-mer count is calculated and a change point detection method is applied to determine the window where there is a drop in telomere repeat count (red dotted line). (D) Telomere length estimates for Nanopore sequencing reads aligning to chromosome 1L and 1R.
    Sequence Read Data, supplied by Illumina Inc, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
    https://www.bioz.com/result/sequence read data/product/Illumina Inc
    Average 90 stars, based on 1 article reviews
    sequence read data - by Bioz Stars, 2026-04
    90/100 stars
      Buy from Supplier

    Image Search Results


    A. Principal components analysis of autochthonous KMC, KM/+C, and KPC tumors cluster into three distinctive groups B. Scatter plot with regression line demonstrating correlation between normalized Kras expression and Myc expression in murine tumors (Pearson r correlation coefficient = 0.74), with grouping of transcriptional clustering, irrespective of genotype. C. Heatmap demonstrating sample clustering by GSVA scoring of hallmark gene sets including several curated PDAc Kras specific signatures. Annotation includes copy number alteration for Kras and Myc, transcriptional clustering, PurIST subtyping by shrunken centroid, and model genotype. D. Copy number alteration plots of KMC mice from low pass whole genome sequencing, separated by transcriptional Cluster demonstrating diverse patterns of chromosomal instability in the KMC model; annotated with approximate location of canonical PDAc drivers and KMC alleles.

    Journal: bioRxiv

    Article Title: Myc and Kras cooperate in adult acinar cells to drive phenotypic heterogeneity, metastasis, and therapeutic resistance in a novel pancreatic cancer mouse model

    doi: 10.1101/2025.07.14.664767

    Figure Lengend Snippet: A. Principal components analysis of autochthonous KMC, KM/+C, and KPC tumors cluster into three distinctive groups B. Scatter plot with regression line demonstrating correlation between normalized Kras expression and Myc expression in murine tumors (Pearson r correlation coefficient = 0.74), with grouping of transcriptional clustering, irrespective of genotype. C. Heatmap demonstrating sample clustering by GSVA scoring of hallmark gene sets including several curated PDAc Kras specific signatures. Annotation includes copy number alteration for Kras and Myc, transcriptional clustering, PurIST subtyping by shrunken centroid, and model genotype. D. Copy number alteration plots of KMC mice from low pass whole genome sequencing, separated by transcriptional Cluster demonstrating diverse patterns of chromosomal instability in the KMC model; annotated with approximate location of canonical PDAc drivers and KMC alleles.

    Article Snippet: Low-pass whole genome sequencing was performed on tumor/normal pairs first by aligning short- read sequencing data with BWA mem with default settings., Second, ichorCNA ( https://github.com/broadinstitute/ichorCNA , Broad Institute, Cambridge, MA) was used to predict CNVs and estimate tumor fraction with default settings.

    Techniques: Expressing, Sequencing

    A. A circular dendrogram of primary human PDAc, KMC, and KPC tumors subjected to RNA sequencing, normalization, homolog mapping, and ComBat normalization. Histogram is by unsupervised Ward D2 clustering. Annotation rings (inner to outermost) describe receipt of chemotherapy (neoadjuvant = pink, adjuvant = blue), pORG score tertile, single-sample GSEA Myc V1 targets score, and survival quartile. Outer ring denotes subject number, colored for humans by PurIST subtyping (blue = basal-like, red = classical) and for mice by genotype (green = KMC, light blue = KM/+C, orange = KPC). The KMC transcriptional clusters (cf. fig 5A) are annotated.

    Journal: bioRxiv

    Article Title: Myc and Kras cooperate in adult acinar cells to drive phenotypic heterogeneity, metastasis, and therapeutic resistance in a novel pancreatic cancer mouse model

    doi: 10.1101/2025.07.14.664767

    Figure Lengend Snippet: A. A circular dendrogram of primary human PDAc, KMC, and KPC tumors subjected to RNA sequencing, normalization, homolog mapping, and ComBat normalization. Histogram is by unsupervised Ward D2 clustering. Annotation rings (inner to outermost) describe receipt of chemotherapy (neoadjuvant = pink, adjuvant = blue), pORG score tertile, single-sample GSEA Myc V1 targets score, and survival quartile. Outer ring denotes subject number, colored for humans by PurIST subtyping (blue = basal-like, red = classical) and for mice by genotype (green = KMC, light blue = KM/+C, orange = KPC). The KMC transcriptional clusters (cf. fig 5A) are annotated.

    Article Snippet: Low-pass whole genome sequencing was performed on tumor/normal pairs first by aligning short- read sequencing data with BWA mem with default settings., Second, ichorCNA ( https://github.com/broadinstitute/ichorCNA , Broad Institute, Cambridge, MA) was used to predict CNVs and estimate tumor fraction with default settings.

    Techniques: RNA Sequencing, Adjuvant

    A. Example IGV views of a germline and postzygotic mutation in HiFi read data. B. The number of autosomal de novo germline and postzygotic SNVs, insertions and deletions <50 bp, and tandem repeat mutations observed in each sample. Sibling pairs are grouped by family and highlighted in blue, with the proband above the sibling. C. Upset plot of origin assignment shows concordance between HiFi haplotypes, ONT haplotypes, and allele balance. D. Allele balance distribution for autosomal germline and postzygotic SNVs across PacBio HiFi, Illumina, and ONT read data. E. Distribution of the size of autosomal insertions, deletions, and tandem repeat mutations.

    Journal: bioRxiv

    Article Title: Long-read sequencing of trios reveals increased germline and postzygotic mutation rates in repetitive DNA

    doi: 10.1101/2025.07.18.665621

    Figure Lengend Snippet: A. Example IGV views of a germline and postzygotic mutation in HiFi read data. B. The number of autosomal de novo germline and postzygotic SNVs, insertions and deletions <50 bp, and tandem repeat mutations observed in each sample. Sibling pairs are grouped by family and highlighted in blue, with the proband above the sibling. C. Upset plot of origin assignment shows concordance between HiFi haplotypes, ONT haplotypes, and allele balance. D. Allele balance distribution for autosomal germline and postzygotic SNVs across PacBio HiFi, Illumina, and ONT read data. E. Distribution of the size of autosomal insertions, deletions, and tandem repeat mutations.

    Article Snippet: We leveraged long-read Pacific Biosciences high-fidelity (HiFi) sequencing data derived from blood for variant discovery, and both long-read Oxford Nanopore Technologies (ONT) and short-read Illumina data for validation purposes.

    Techniques: Mutagenesis

    Analysis of telomere repeat k-mers in long read sequences from A. thaliana Col-0. (A) Example Type I, II, and III reads from A. thaliana Col-0 Nanopore reads. Each color represents an exact match to the telomere repeat AAACCCT and the position found across the sequencing read. Reads are shown so the 5’ end of the sequence is enriched for the AAACCCT repeat and reads with 3’ end enriched for the complement sequence TTTGGGA we show the reverse complement sequence. (B) Co-occurrence heatmap displaying the frequency of a telomere repeat 5-mer (original telomere repeat sequence is AAACCCT) with all possible dinucleotide sequences that can be found at the end of the 5-mer. Top shows frequencies from analyzing reads aligning to chromosome 1R for Nanopore reads and bottom show frequencies from PacBio reads aligning to chromosome 1R. (C) An example Type III Nanopore sequencing read displaying the occurrence of a 4-mer and 5-mer of the original telomere repeat sequence AAACCCT (Top). A sliding window analysis where each window is size 100 bp and it slides 7 bp (bottom). In the window the average k-mer count is calculated and a change point detection method is applied to determine the window where there is a drop in telomere repeat count (red dotted line). (D) Telomere length estimates for Nanopore sequencing reads aligning to chromosome 1L and 1R.

    Journal: bioRxiv

    Article Title: Topsicle: a method for estimating telomere length from whole genome long-read sequencing data

    doi: 10.1101/2025.07.10.664126

    Figure Lengend Snippet: Analysis of telomere repeat k-mers in long read sequences from A. thaliana Col-0. (A) Example Type I, II, and III reads from A. thaliana Col-0 Nanopore reads. Each color represents an exact match to the telomere repeat AAACCCT and the position found across the sequencing read. Reads are shown so the 5’ end of the sequence is enriched for the AAACCCT repeat and reads with 3’ end enriched for the complement sequence TTTGGGA we show the reverse complement sequence. (B) Co-occurrence heatmap displaying the frequency of a telomere repeat 5-mer (original telomere repeat sequence is AAACCCT) with all possible dinucleotide sequences that can be found at the end of the 5-mer. Top shows frequencies from analyzing reads aligning to chromosome 1R for Nanopore reads and bottom show frequencies from PacBio reads aligning to chromosome 1R. (C) An example Type III Nanopore sequencing read displaying the occurrence of a 4-mer and 5-mer of the original telomere repeat sequence AAACCCT (Top). A sliding window analysis where each window is size 100 bp and it slides 7 bp (bottom). In the window the average k-mer count is calculated and a change point detection method is applied to determine the window where there is a drop in telomere repeat count (red dotted line). (D) Telomere length estimates for Nanopore sequencing reads aligning to chromosome 1L and 1R.

    Article Snippet: Methods such as TelSeq [ ], K-seek [ , ], Computel [ ], Telomerecat [ ], and TelomereHunter [ ] have been used to analyze short read sequencing data (usually generated from the Illumina sequencing platform) for estimating the telomere length of a sample using an approach analogous to a qPCR based method.

    Techniques: Sequencing, Nanopore Sequencing

    Distribution of Telomere Repeat Count (TRC) values from raw long sequences. (A) TRC values for A. thaliana Col-0 Nanopore reads (ERR11436636) that were visually categorized as Type I, II, or III reads. TRC values were calculated using the 4-mers from the telomere repeat sequence AAACCCT. (B) TRC values for A. thaliana Col-0 Nanopore reads aligning to chromosome ends or from all sequencing reads. TRC values were calculated using the 4-mers from the telomere repeat sequence AAACCCT. (C) TRC values for maize B73 PacBio reads aligning to chromosome ends or from all sequencing reads. For maize the TRC values were calculated using the reads from a single sequencing library (ERR3288278) out of a total 18 libraries sequenced for B73. TRC values were calculated using the 4-mers from the telomere repeat sequence AAACCCT. (D) TRC values for Maize B73 PacBio reads aligning to chromosome ends or from all sequencing reads. TRC values were calculated using the 5-mers from the telomere repeat sequence AAACCCT. (E) Telomere length estimates from the A. thaliana Col-0 Nanopore reads using various TRC value cutoff and k-mer sizes. (F) Telomere length estimates from the maize B73 PacBio reads using various TRC value cutoff and k-mer sizes.

    Journal: bioRxiv

    Article Title: Topsicle: a method for estimating telomere length from whole genome long-read sequencing data

    doi: 10.1101/2025.07.10.664126

    Figure Lengend Snippet: Distribution of Telomere Repeat Count (TRC) values from raw long sequences. (A) TRC values for A. thaliana Col-0 Nanopore reads (ERR11436636) that were visually categorized as Type I, II, or III reads. TRC values were calculated using the 4-mers from the telomere repeat sequence AAACCCT. (B) TRC values for A. thaliana Col-0 Nanopore reads aligning to chromosome ends or from all sequencing reads. TRC values were calculated using the 4-mers from the telomere repeat sequence AAACCCT. (C) TRC values for maize B73 PacBio reads aligning to chromosome ends or from all sequencing reads. For maize the TRC values were calculated using the reads from a single sequencing library (ERR3288278) out of a total 18 libraries sequenced for B73. TRC values were calculated using the 4-mers from the telomere repeat sequence AAACCCT. (D) TRC values for Maize B73 PacBio reads aligning to chromosome ends or from all sequencing reads. TRC values were calculated using the 5-mers from the telomere repeat sequence AAACCCT. (E) Telomere length estimates from the A. thaliana Col-0 Nanopore reads using various TRC value cutoff and k-mer sizes. (F) Telomere length estimates from the maize B73 PacBio reads using various TRC value cutoff and k-mer sizes.

    Article Snippet: Methods such as TelSeq [ ], K-seek [ , ], Computel [ ], Telomerecat [ ], and TelomereHunter [ ] have been used to analyze short read sequencing data (usually generated from the Illumina sequencing platform) for estimating the telomere length of a sample using an approach analogous to a qPCR based method.

    Techniques: Sequencing

    Overview of Topsicle estimating telomere length from long read sequencing data.

    Journal: bioRxiv

    Article Title: Topsicle: a method for estimating telomere length from whole genome long-read sequencing data

    doi: 10.1101/2025.07.10.664126

    Figure Lengend Snippet: Overview of Topsicle estimating telomere length from long read sequencing data.

    Article Snippet: Methods such as TelSeq [ ], K-seek [ , ], Computel [ ], Telomerecat [ ], and TelomereHunter [ ] have been used to analyze short read sequencing data (usually generated from the Illumina sequencing platform) for estimating the telomere length of a sample using an approach analogous to a qPCR based method.

    Techniques: Sequencing

    Applying Topsicle on simulated dataset. (A) Telomere length was estimated with Topsicle using the 4-mer and TRC value cutoff of 0.4 on 30 simulated reads with error rates of 10%, 20%, and 30% for reads with varying length and proportion of telomere repeat. For each simulation the read length is indicated on the right side of the bar (“|”) and the length of the telomere repeat is on the left side of the bar. (B) Coverage simulation by random sampling the whole genome sequencing data and using Topsicle to estimate telomere length. For each coverage the random sampling was done 20 times and Topsicle was used to estimate the telomere length using the 4-mer telomere repeat and TRC value cutoff of 0.4 to select for telomere reads. Each point represents the median telomere length from the sampled dataset.

    Journal: bioRxiv

    Article Title: Topsicle: a method for estimating telomere length from whole genome long-read sequencing data

    doi: 10.1101/2025.07.10.664126

    Figure Lengend Snippet: Applying Topsicle on simulated dataset. (A) Telomere length was estimated with Topsicle using the 4-mer and TRC value cutoff of 0.4 on 30 simulated reads with error rates of 10%, 20%, and 30% for reads with varying length and proportion of telomere repeat. For each simulation the read length is indicated on the right side of the bar (“|”) and the length of the telomere repeat is on the left side of the bar. (B) Coverage simulation by random sampling the whole genome sequencing data and using Topsicle to estimate telomere length. For each coverage the random sampling was done 20 times and Topsicle was used to estimate the telomere length using the 4-mer telomere repeat and TRC value cutoff of 0.4 to select for telomere reads. Each point represents the median telomere length from the sampled dataset.

    Article Snippet: Methods such as TelSeq [ ], K-seek [ , ], Computel [ ], Telomerecat [ ], and TelomereHunter [ ] have been used to analyze short read sequencing data (usually generated from the Illumina sequencing platform) for estimating the telomere length of a sample using an approach analogous to a qPCR based method.

    Techniques: Sampling, Sequencing

    Application of Topsicle on A. thaliana long read sequencing dataset. (A) Distribution of telomere length estimates from analyzing whole genome long read sequencing data of 104 A. thaliana ecotypes. Red dotted line indicates the median (2815 bp). (B) Scatter plot of telomere lengths from 31 ecotypes that have telomere length estimates from TRF and Topsicle. Pearson’s r is shown in the upper-left side and the line of best fit is shown in red line (TRF telomere length = 1.71 ξ Topsicle telomere length – 1118.13). (C) Boxplot of chromosome specific telomere lengths across the 104 A. thaliana ecotypes. Note for chromosome 2L and 4L the majority of the ecotypes did not have reads aligning to those chromosome ends, which prevented the estimation of their chromosome specific telomere length with Topsicle.

    Journal: bioRxiv

    Article Title: Topsicle: a method for estimating telomere length from whole genome long-read sequencing data

    doi: 10.1101/2025.07.10.664126

    Figure Lengend Snippet: Application of Topsicle on A. thaliana long read sequencing dataset. (A) Distribution of telomere length estimates from analyzing whole genome long read sequencing data of 104 A. thaliana ecotypes. Red dotted line indicates the median (2815 bp). (B) Scatter plot of telomere lengths from 31 ecotypes that have telomere length estimates from TRF and Topsicle. Pearson’s r is shown in the upper-left side and the line of best fit is shown in red line (TRF telomere length = 1.71 ξ Topsicle telomere length – 1118.13). (C) Boxplot of chromosome specific telomere lengths across the 104 A. thaliana ecotypes. Note for chromosome 2L and 4L the majority of the ecotypes did not have reads aligning to those chromosome ends, which prevented the estimation of their chromosome specific telomere length with Topsicle.

    Article Snippet: Methods such as TelSeq [ ], K-seek [ , ], Computel [ ], Telomerecat [ ], and TelomereHunter [ ] have been used to analyze short read sequencing data (usually generated from the Illumina sequencing platform) for estimating the telomere length of a sample using an approach analogous to a qPCR based method.

    Techniques: Sequencing

    Application of Topsicle on maize long read sequencing dataset. (A) Telomere length estimates for the 18 PacBio libraries for B73. (B) Telomere length estimates for the 16 PacBio libraries for OH43. (C) Boxplot of telomere length estimates for the 27 maize genotypes using Topsicle. The plot shows the telomere length estimates of reads from all sequencing libraries of a genotype that passed a TRC cutoff of 0.4 calculated using a 5-mer repeat sequence. For the genotypes that were previously analyzed with TRF, the telomere length estimates are indicated with a red diamond.

    Journal: bioRxiv

    Article Title: Topsicle: a method for estimating telomere length from whole genome long-read sequencing data

    doi: 10.1101/2025.07.10.664126

    Figure Lengend Snippet: Application of Topsicle on maize long read sequencing dataset. (A) Telomere length estimates for the 18 PacBio libraries for B73. (B) Telomere length estimates for the 16 PacBio libraries for OH43. (C) Boxplot of telomere length estimates for the 27 maize genotypes using Topsicle. The plot shows the telomere length estimates of reads from all sequencing libraries of a genotype that passed a TRC cutoff of 0.4 calculated using a 5-mer repeat sequence. For the genotypes that were previously analyzed with TRF, the telomere length estimates are indicated with a red diamond.

    Article Snippet: Methods such as TelSeq [ ], K-seek [ , ], Computel [ ], Telomerecat [ ], and TelomereHunter [ ] have been used to analyze short read sequencing data (usually generated from the Illumina sequencing platform) for estimating the telomere length of a sample using an approach analogous to a qPCR based method.

    Techniques: Sequencing

    Application of Topsicle on Mimulus Nanopore sequencing data. (A) Co-occurrence heatmap displaying the frequency of a telomere repeat. Top shows the telomere repeat 4-mer (original telomere repeat sequence is 6-mer AAACCG) with all possible dinucleotide sequences that can be found at the end of the 4-mer. Bottom shows the telomere repeat 5-mer (original telomere repeat sequence is 7-mer AAACCCG) with all possible dinucleotide sequences that can be found at the end of the 5-mer. Frequencies were counted from Nanopore reads that aligned to chromosome ends of the reference genomes of each respective species. (B) Boxplot of telomere length estimated from Topsicle. For each species the TRF based telomere length is indicated with a red diamond. Significant differences in Topsicle estimated telomere length after a Mann Whitney U test are indicated with *** (p-value < 0.0001).

    Journal: bioRxiv

    Article Title: Topsicle: a method for estimating telomere length from whole genome long-read sequencing data

    doi: 10.1101/2025.07.10.664126

    Figure Lengend Snippet: Application of Topsicle on Mimulus Nanopore sequencing data. (A) Co-occurrence heatmap displaying the frequency of a telomere repeat. Top shows the telomere repeat 4-mer (original telomere repeat sequence is 6-mer AAACCG) with all possible dinucleotide sequences that can be found at the end of the 4-mer. Bottom shows the telomere repeat 5-mer (original telomere repeat sequence is 7-mer AAACCCG) with all possible dinucleotide sequences that can be found at the end of the 5-mer. Frequencies were counted from Nanopore reads that aligned to chromosome ends of the reference genomes of each respective species. (B) Boxplot of telomere length estimated from Topsicle. For each species the TRF based telomere length is indicated with a red diamond. Significant differences in Topsicle estimated telomere length after a Mann Whitney U test are indicated with *** (p-value < 0.0001).

    Article Snippet: Methods such as TelSeq [ ], K-seek [ , ], Computel [ ], Telomerecat [ ], and TelomereHunter [ ] have been used to analyze short read sequencing data (usually generated from the Illumina sequencing platform) for estimating the telomere length of a sample using an approach analogous to a qPCR based method.

    Techniques: Nanopore Sequencing, Sequencing, MANN-WHITNEY